Deconstructing the Myth 
of American Public 
Schooling Inefficiency 

Bruce D. Baker 
Mark Weber 



ALBERT SHANKER INSTITUTE 




Deconstructing the Myth 
of American Public 
Schooling Inefficiency 


Bruce D. Baker 
Mark Weber 


June 2016 



ALBERT SHANKER INSTITUTE 


ABOUT THE AUTHORS 


Bruce Baker is Professor in the Department of Educational Theory, Policy and Administration at Rutgers, the 
State University of New Jersey. He specializes in state school finance policies, the economics of education and 
legal and financial issues related to charter schooling. He sits on the editorial boards of the journals Education 
finance and Poli^ and The Journal of Education Finance and has published widely in academic journals. He is also 
lead author of Is School Funding Fair, a national report on state school finance systems in collaboration with the 
Education Law Center of New Jersey. He has consulted widely for state legislatures and has served as an expert 
witness in litigation related to school funding in several states. 

Mark Weber is a PhD student at Rutgers University's Graduate School of Education, concentrating in Policy, 
Organization, and Theory. He is also a music educator in Warren Township, New Jersey. In addition to peer- 
reviewed articles and policy briefs, Weber also writes regularly on education policy for a variety of media out¬ 
lets. His blog, "Jersey Jazzman," is one of the most widely read education blogs in the nation. Weber's research 
interests include charter schools, teacher evaluation, testing, and arts education. 


THE ALBERT SHANKER INSTITUTE, endowed by the American Federation of Teachers and named in honor of its 
late president, is a nonprofit, nonpartisan organization dedicated to three themes—children’s education, unions as ad¬ 
vocates for quality, and both civic education and freedom of association in the public life of democracies. Its mission is 
to generate ideas, foster candid exchanges and promote constructive policy proposals related to these issues. The Insti¬ 
tute commissions original analyses, organizes seminars, sponsors publications and subsidizes selected projects. Its inde¬ 
pendent board of directors is composed of educators, business representatives, labor leaders, academics and public 
policy analysts. 


BOARD OF DIRECTORS 

Danielle Allen 
Paul E. Almeida 
Anthony Bryk 
Carolyn (Cali) Cole 
Linda Darling-Hammond 
Han Dongfang 
Sara Goldrick-Rab 
Ernest Green 
Sarita Gupta 
Andy Hargreaves 
E. D. Hirsch, Jr. 


Sol Hurwitz 
John Jackson 
Clifford B. Janey 
Lorretta J ohnson 
Susan Moore Johnson 
Richard Kahlenberg 
Ted Kirsch 
Carrie Leana 
Stanley S. Litow 
Michael Maccoby 
Herb Magidson 
Harold Meyerson 


Daniel Montgomery 
Susan B. Neuman 
Pedro Noguera 
Mary Cathryn Ricker 
Richard Riley 
William Schmidt 
Randi Weingarten 
Deborah L. Wince-Smith 

EXECUTIVE DIRECTOR 

Leo E. Casey 


This document was written for the Albert Shanker Institute and does not necessarily represent the views of the 
members of its board of directors. 

Copyright © 2016 Albert Shanker Institute. Permission is hereby granted to reproduce and distribute copies of the 
work for nonprofit education purposes, provided that copies are distributed at or below cost, and that the author 
and copyright notice are included on each copy. Any other distribution of these materials is prohibited without first 
receiving express written permission. 


TABLE OF CONTENTS 


Executive Summary 1 

Organization of the Report 2 

American Public Schooling Inefticiency 3 

Missing Measures and Misguided Inferences 4 

What Factors Aftect Education Costs? 6 

Socio-Economic Context 9 

Matching Input and Outcome Measures 17 

Conclusions and Implications 25 

References 27 


Appendix A; Alternative Unconditional, Nominal 


29 




EXECUTIVE SUMMARY 


In this paper, we begin by classifying the arguments that assert American schools are relatively inefficient into two 
categories: the long-term trend argument and the international comparison argument. Our focus herein is on the latter of 
these two. We then describe two frameworks for approaching either of these arguments: cost efficient and production 
ejficieng. We explain that the typical spending/outcome model used to make the case that the United States is a rela¬ 
tively inefficient nation is whoUy unsuitable for drawing these or any conclusions. Accounting for differences in 
student populations is helpful, but stiU inadequate for building a model that can be used to assess a country’s rela¬ 
tive efficiency. Evaluating education inputs such as teacher wages and class sizes can further refine comparisons 
between nations; however, it is unUkely that even these refinements are enough to conduct analyses that can credi¬ 
bly back claims about the relative efficiency of America’s education system. That said, an appropriately limited anal¬ 
ysis can still inform our understanding of how the U.S public education system compares with systems in other 
countries. 

What does this all mean? 

First and foremost, we can say with some confidence that existing expositions of U.S. inefficiency (based on Or¬ 
ganization for Economic Cooperation and Development [OECD] national spending data and Program for Interna¬ 
tional Student Assessment [PISA] scores) are so lacking in methodological rigor that they are of little if any value in 
public discourse or for informing national education policies. 

Second, it is unlikely that we could ever obtain data of sufficient precision, accuracy and comparability to meet the 
demands of more legitimate efficiency modeling for cross-national, intercontinental analyses. 

Any and all comparisons using OECD and related data should be conducted with consideration of the limitations 
discussed herein. But some insights might be drawn from our analyses: 

Among other things, the OECD per-pupil spending measure, as incomparable as it is, shows that the U.S. may 
have higher per-pupil spending than many nations, but falls right in line with expectations for nations of similar 
gross domestic product (GDP) per capita. 

The U.S. is both a high-spending and high-GDP country, but some of that high education spending may be a func¬ 
tion of the scope of services and expenses included under the education umbrella in the U.S. 

We also know that despite seemingly high spending levels in the United States, teachers’ wages lag with respect to 
other professions, and the wage lag is not a result of providing relatively smaller class sizes. 

In fact, our primary class sizes (roughly equivalent to schooling provided from about age 5 through 11 or 12 years 
of age) are average and lower secondary (roughly equivalent to schooling provided from about 12 to 16 years of 
age)^ class sizes large. Our wage lag is, to an extent, a function of high non-teaching wages (related to our high 
GDP per capita), necessarily making it more expensive to recruit and retain a high-quality teacher workforce. 

To summarize: The U.S. is faced with a combination of seemingly high education expense, but noncompetitive 
compensation for its teachers, average to large class sizes, and a high rate of child poverty. Again, it’s hard to con¬ 
ceive how such a combination would render the U.S. comparable in raw test scores to low-poverty nations like Ko¬ 
rea or Finland, or small, segregated, homogeneous enclaves like Singapore or Shanghai.^ 


1 See UNESCO (2012). Note that the 2011 ISCED classification scheme does not specify age ranges, focusing instead on the purposes of 
the levels of education (basic preparation, etc.). But that system is crosswalked to the 1997 scheme which does specific age ranges and 
where the previous (1997) ISCED level 1 and level 2 remain aligned with the present (2011). 

2 Shanghai in particular has several mitigating factors that make comparing its scores to other nations highly suspect; see: http:// 
www.brookings.edu/research/papers/2013/10/09-pisa-china-problem-loveless 
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ORGANIZATION OF THE REPORT 


We begin this policy brief with a short introduction to the long-term trend argument and the international comparison 
argument. We summarize the debunking of the former, then move on to our focus: why claims of inefficiency in 
the United States’ education system that use international comparisons are invalid, and what appropriately lim¬ 
ited analyses of data can tell us about the nation’s schools. 

The remainder of the brief is organized in three sections. First, we address the missing measures and resulting 
misguided inferences of common cross-national efficiency comparisons. We begin with a discussion of more 
rigorous academic literature on educational efficiency analysis, relevant modeling strategies and key factors that 
influence education costs, which are most often omitted entirely in casual cross-national comparisons. Next, we 
discuss additional issues that complicate comparisons of organizationally disparate and geographically distant 
educational contexts. Specifically, we focus on difficulties in accounting for differences in children’s family and 
socio-economic backgrounds, and on differences in the size and organization of education systems in OECD 
countries compared with the U.S. as a whole and compared with individual U.S. states. We conclude with a dis¬ 
cussion of the mismatch between measures of education spending across nations, raising questions about the 
breadth of services and related costs embedded in U.S. education spending figures versus those of other coun¬ 
tries. We conclude with a closer look at specific schooling resources, including teacher compensation and class 
size. 


AMERICAN PUBLIC SCHOOLING INEFFICIENCY 


Persistent arguments that school spending has litde or no relation to school quality are often buttressed by two 
evidentiary claims:^ 

1) Education spending in the United States has doubled if not tripled over the long term, yet commonly meas¬ 
ured education outcomes have remained ‘Virtually flat”; 

2) Education spending in the United States is among the highest in the world, but our commonly measured out¬ 
comes lag well behind other nations. 

We describe these as the long-term trend argument and the international comparison argument.^ Both tend to dominate 
the public discourse, even as more nuanced assessments of how and why money matters exist in academic litera¬ 
ture.*^ 

The validity of these arguments is suspect even at face value if only because accurately measuring the changes in 
test-based outcomes attributable to the effectiveness and efficiency of a nation’s education system or the value of 
the education dollar over time and across contexts is, at best, a complicated endeavor. Measuring changes in test 
scores, across changing cohorts and contexts to determine whether performance has remained “virtually flat” is, 
by itself, no simple task. But even if we set aside these difficulties, the claim of stagnant national progress is not 
held up by the evidence. Rothstein (2011) shows that, in fact, “On these exams [National Assessment of Educa- 


^In a 2011 opinion piece in the Huffington Post titled ‘Tlip the Curve: Student Achievement vs. School Budgets,” Bill 
Gates of Microsoft used both of these claims. Haddad (2014) also uses both claims. 

^Borrowing from Jackson, Johnson & Persico (2015). 

^For a discussion of the sources and recurrence of these arguments, see Baker (2016) and Baker and Weiner (2011). 

6 Baker (2016) identifies three additional common arguments made by those wishing to either downplay or deny outright 
the role of funding in determining schooling quality: (1) tallies, or ‘Vote counts” of correlational studies between spending 
and outcomes, without regard for rigor of the analyses and quality of the data on which they depend; (2) anecdotal asser¬ 
tions that states such as New Jersey and cities such as Kansas City, Mo., provide proof positive that massive infusions of 
funding have proven ineffective at improving student outcomes; and (3) the assertion that how money is spent is much 
more important than how much is available. 
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tional Progress], American students have improved substantially, in some cases phenomenally.” (p. 1) Other re¬ 
search confirms that, when accounting for differences in student disadvantage, U.S students perform much bet¬ 
ter than what is suggested by commonly cited, unadjusted rankings that fail to account for changes in subgroup 
proportions when aggregating test results (Carnoy & Rothstein, 2013). Similarly, as we shall see below, interna¬ 
tional comparisons of education spending that use simple mean per-pupil spending figures compiled by the Or¬ 
ganization for Economic Cooperation and Development are whoUy inadequate to support any claims of the U.S. 
education system’s alleged profligacy. 

Regarding the long-term trend Jackson, Johnson & Persico (2015) rebut the facile nature of assertions 

built exclusively on upward trending spending and supposed “virtually flat” test scores with a clever analogy: 

“... consider the following true statistics: between 1960 and 2000 the rate of cigarette smoking for fe¬ 
males decreased by more than 30 percent while the rate of deaths by lung cancer increased by more than 
50 percent over the same time period. An analysis of these time trends might lead one to infer that 
smoking reduces lung cancer. However, most informed readers can point out numerous flaws in looking 
at this time trend evidence and concluding that ‘if smoking causes lung cancer, then there should have 
been a large corresponding reduction in cancer rates so that there can be no link between smoking and 
lung cancer.’ ” 

The unconditional, contextually detached logic of this example is analogous to the assertion that increased 
spending over time, coupled with flat test scores, proves that school spending matters little. In addition, at least 
the second of these assertions (flat test scores) is false to begin with, and the first (increased spending) is far 
more complicated than it appears. 

The long-term trend argument has been thoroughly rebutted on numerous occasions (Baker, 2016 Baker & Weiner, 
2011; Jackson, Johnson & Persico, 2015; Rothstein, 2011). Yet except for Carnoy and Rothstein’s (2013) dissec¬ 
tion of the test score comparisons, less attention has been paid to the international comparison argument. Essential¬ 
ly, this argument contends that the United States is spending more and performing worse than other countries 
simply because it is less efficient than those countries. We must be doing something wrong, and as a result 
should take efficiency lessons from those who “do more with less.”^ The assertion that the American public edu¬ 
cation system is less efficient than nearly any other education system in the world often goes something like this: 

'^Compared to other countries, America has spent more and achieved less. We need to build exceptional 
teacher personnel systems that identify great teaching^ reward it, and help every teacher get better. 

Bill Gates, Microsoft co-founder 

The evidentiary basis for this claim is most often either built on simple references to U.S. per-pupil spending on 
primary, lower secondary and upper secondary education (Indicator Bl) compared with other nations as report¬ 
ed in the OECD Education at a Glance series, coupled with scale scores of U.S. students on the Program for Inter¬ 
national Student Assessment (PISA).^ We spend high according to OECD, score low according to PISA, and 
thus are inefficient. Marginally more rigorous analyses plot PISA scores against national average per-pupil spend- 


^More extreme versions of this argument go so far as to assert that “the United States spends more on schools than any society in human 
history,” https://higheredrevolution.com/the-united-states-spends-more-on-schools-than-any-society-in-human-history- 

d5988649d73e#.lik6bfskq 

^Gates (2011) http://www.huffingtonpost.com/bill-gates/bill-gates-school-performance b 829771.html The graph Gates includes with 
this piece, which purports to show the U.S. has increased education spending dramatically with little improvement in test scores, was cri¬ 
tiqued here: Charts (2011) Bill Gates should hire a statistical advisor.” http://iunkcharts.typepad.com/junk charts/2011/04/bill- 

gates-should-hire-a-statistical-advisor.html 

^See for example: http://www.ncee.org/2015/01 /statistic-of-the-month-education-performance-equity-and-efficiency/. which explains 
“Several countries have high scores on PISA and achieve high levels of equity in student performance while spending relatively little per 
student” as a basis for comparing “efficiency.” 
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ing estimates (annual, or cumulative for ages 6 to 15), showing the U.S. among the world’s higher-spending na¬ 
tions, with no additional yield in PISA scores, and an overall weak relationship between spending and test scores 
across nations (See OECD, 2012)do 


MISSING MEASURES AND MISGUIDED INFERENCES 

Understanding Efficiency Analysis in Education 

There exists a relatively large body of empirical literature on applied efficiency analysis in education (Grosskopf, 
Hayes & Taylor, 2014); that is, the very kind of methods and models that might be used to distill whether one 
institution, organization or jurisdiction more efficiently produces education outcomes than another. The same 
basic principles and methods apply whether we are evaluating individual schools, local education agencies 
(public school districts in the U.S. model of schooling), state school systems or national education systems. The 
main difference is that as we move to more and more geographically, economically and culturally distant and 
distinct systems, constructing consistent measures of schooling inputs and outcomes (while considering con¬ 
founding factors) becomes far more difficult yet far more important. 

Efficiency analysis in education (or any sector) may be framed from a cost perspective or production perspective. Cost 
efficiency and production efficiency are flip sides of the education spending coin. Each involves identifying out¬ 
comes, spending toward achieving those outcomes, and various observable conditions—cost factors—that af¬ 
fect the spending —outcome relationship. However, each seeks to answer different questions: 

• Cost Efficiency: Given the outcomes a unit (school) currently achieves, compared to the lowest-spending 
unit achieving the same (aU else equal), how much does the given unit spend? 

• Production Efficiency: Given the current spending levels (and other factors), how do the outcomes of the 
unit compare to the maximum outcomes achieved with comparable spending (and other factors)? 

Figure 1 presents a view of cost efficient, across schools of varied scale, or enrollment size. The “cost” of produc¬ 
ing a given level of outcomes varies by the scale of the school, where very small schools face much higher costs 
to achieve the same outcomes as scale-efficient schools. A subset of schools may faU along the lower boundary, 
the underlying “cost frontier” or minimum expense, at a given school size, at which the desired outcomes can be 
produced. Most schools wiU fall some distance above that frontier, or be spending somewhat more than the 
minimum to achieve the same outcomes. Some of those differences may be attributed to factors weVe simply 
missed in our data and models—because they weren’t measured or were measured poorly; this is partly why we 
shouldn’t live by the data and models alone. Some of those differences may also include expenditures deemed 
valuable by constituents, but which don’t contribute directly to the outcome measures in question. Valuable les¬ 
sons may be learned from both exploring those schools that fall along the cost frontier and those that deviate 
from it. 

Figure 2 presents the production efficient perspective. Here, it is assumed that for each additional doUar-input to 
schooling, there is a commensurate gain in student outcomes, with diminishing returns. Some schools will faU 
along—or define—the frontier, or maximum output for any given level of expense, and others wiU fall below 
that frontier, for the same types of reasons that schools deviate from the cost frontier. 


a review of 43 “high quality” studies of schooling resources in developing nations, Glewwe, Hanushek, Humpage & Ravina (2011) 
suggest that perhaps among less-developed nations increases in basic resources (roof over head, desk in which to sit) do associate with 
improved school quality, but among more-developed higher-spending nations, the spending-to-outcome relationship breaks down for a 
variety of reasons. 
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Measured Outcomes Per-pupil Cost of Common Outcomes 


Figure 1. Cost Efficiency Framework 



Figure 2. Product Efficiency Framework 



Spending per Pupil 
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The visuals here are stylized for simplicity, reducing efficiency analysis to two dimensions. No credible scholar 
or analyst would consider a simple, unconditional cross-sectional analysis of the relationship between nominal 
per-pupil spending and average/aggregate test score levels as a legitimate basis for making “efficiency” compar¬ 
isons. That is, a simple scatterplot of the relationship between per-pupil spending (nominal, unadjusted in time 
or space across units) on the horizontal axis and test scores on the vertical axis is relatively useless for inferring 
the spending-to-outcomes relationship or making judgments about relative efficiency. This is true for compari¬ 
sons across schools within a jurisdiction, across local public school districts within a state and across U.S. states. 
In addition, it is certainly true for cross-national comparisons. Unfortunately, this is as far as many cross¬ 
national comparisons go when attempting inferences about relative efficiency. 

WHAT FACTORS AFFECT EDUCATION COSTS? 

Figure 3 summarizes factors that affect education costs from one school to the next, one district or state to the 
next, and one nation or continent to the next. We reiterate that student outcomes in this model are an input 
The amount a school system spends is determined, among other factors, by the education outcomes that sys¬ 
tem seeks to achieve. Once we step outside comparisons of schools operating within a single labor market of 
consistent geography (no huge swings in population density, major geographic barriers, etc.), simple compari¬ 
sons of education spending from one unit to the next are no longer simple. The value of the education dollar 
varies toward purchasing even the same quality and quantity of education inputs, where the key inputs to educa¬ 
tion are teachers, and where the wage required to recruit and retain teachers of specific qualifications varies 
from one location and setting to the next. 

One of the most significant factors affecting the “cost” of providing comparable educational services across 
schools and districts is economies of scale, or average school size, which may be constrained by geography and 
population density (Andrews, Buncombe & Yinger, 2002). Very small schools that are geographically infeasible 
to consolidate with other schools necessarily operate with much lower staffing ratios (number of teachers and 
other staff to number of students), and have to maintain a basic level of overhead (buildings, grounds, utilities, 
administration) elevating per-pupil operating costs substantially. Public school districts in remote rural areas, 
serving around 300 pupils, tend to operate at nearly double the per-pupil costs of districts with 2,000 or more 
pupils in more population-dense areas, all else being equal. Further, when comparing education outcomes, one 
must consider various attributes of the student populations being served, to the extent that those attributes cor¬ 
relate with outcomes. 

Credible models comparing school or district efficiency across U.S. schools and districts also typically consider 
student disability status (total numbers and severity levels), language proficiency status, indicators of child/ 
family income and poverty, and sometimes race/ethnicity of student population (Baker, 2011). 

Figure 3, which presents the “cost model” perspective, also includes a reference to “inefficiency.” As suggested 
in Figures 1 and 2, inefficiency is the difference between what was actually spent versus the minimum that 
needs to be spent to achieve a given level of outcomes. A separate body of literature has addressed attributes of 
public jurisdictions (jurisdictions with control over public budgets, taxing and spending) that may be associated 
with inefficiency (Borge, Falch & Tovmo, 2008; Grosskopf, Hayes, Taylor & Weber, 2001). In particular, 
measures of interjurisdictional competition and “public monitoring” (often measured by extent of proximal/ 
local involvement in finance and decision-making) have been identified as leading to greater efficiency. Ineffi¬ 
ciency, on the other hand, is more likely to be associated with greater fiscal capacity; in other words, those who 
can spend more (even for constant measured outcomes) are more likely to do so. 


iiDuncombe and Yinger (2007) explain that the “cost model” perspective is more useful for sorting out cost vs. inefficiency because it per¬ 
mits (more logically than the production model) the inclusion of measures of fiscal capacity and public monitoring to more precisely isolate 
“cost.”As Baker and Levin (2014) explain, much of this seemingly inefficient spending involves jurisdictions with resources providing 
“extras” that matter to their constituents, even if they don’t affect testing outcomes. Statistically, this spending might be characterized as 
“inefficient,” but that does not mean it is unimportant, or considered so by the constituents who supported that spending. 
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Figure 3. Factors Affecting Education Costs 
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Source: Baker Levin (2014) 


A thorough applied production-efficiency analysis in education would estimate the gains in student outcomes —^in 
other words, the value added over what students enter with ^^—^ls a function of: 

a) the resources expended on comparable/relevant services, 

b) the geographic factors affecting structural costs and input prices, and 

c) the student characteristics that might have exerted exogenous influence on achievement gains, 
including disability status, language proficiency and child poverty. 


By contrast, in cost modeling, we predict the spending levels (as the dependent variable) associated with achiev¬ 
ing given levels of student outcomes, controlling for factors that affect the value of the education dollar toward 
contributing to outcomes, and additionally correcting for factors in Figure 3 that may explain differences in inef¬ 
ficiency across institutions or jurisdictions (Baker, 2016, pp. 12-14). Typically the models used would be estimat¬ 
ed to schools or districts as units using multiple years of annual data, to ensure stable, reliable estimates 
(Gronberg, Jansen & Taylor, 2012). Even then, our ability to precisely, consistently identify more- and less- 
efficient schools, districts, states, or even countries is suspect due to the imprecision of the data used to create 
the models, and the many omitted variables that might bias those models (see Bifulco & Bretschneider, 2001; 
Buncombe & Bifulco, 2002). 


Commonly Invoked International Comparisons 

Now that weVe established what a credible production or cost analysis should include, let’s take a look at com¬ 
monly cited evidence behind the claim of U.S. inefficiency, starting (and ending) with what is actually one of the 
more rigorous analyses. In the report Does Money Buj Strong Performance in PISyi? (OECD, 2012), the authors use 
two simple frameworks to evaluate a nation’s educational efficiency: the relationship between both national 
gross domestic product and PISA scores, and measures of per-pupil expenditure and PISA scores. They find 
that for lower-spending nations, there does indeed appear to be a relationship between GDP and outcomes, as 
well as a relationship between spending and outcomes. For wealthier, higher-spending nations, however, this 
relationship falls apart. 


^2 Alternatively, one could focus on levels of student outcomes to the extent that one sufficiently captures student background characteristics 
predictive of students’ initial performance. 
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Figure 4 presents our own re-creation of the cross-national spending to PISA score relationship, fit with a log- 
Hnear (diminishing returns) curve (we present alternative versions in Appendix A). Note that this is the very kind 
of graph/analysis we earlier characterised as insufficient because it does not take into account student factors 
(individual and collective), geographic factors or structural factors. But even when using this simplistic model, it 
would appear that the United States is only slighdy below the curve and certainly not the inefficient standout of 
Austria or Luxembourg. 


Figure 4. The Unconditional PISA Production Function 



Y = 46.332ln{x) +75.503 


y = 33.398ln(x) + 189.44 


♦ Math Mean 2009 
■ Reading Mean 2009 


OECD (2012), Education at a Glance 2012: OECD Indicators, OECD Publishing, http://dx.doi.org/10.1787/eag-2012-en 

Indicator Bl.la: http://dx.doi.org/10.1787/888932665867 

OECD (2010), PISA 2009 Results: What Students Know and Can Do - Student 

Performance in Reading, Mathematics and Science (Volume I) http://dx.doi.org/10.1787/9789264091450-en 


Figure 4 above appears on the surface to confirm the classic economic expectation of positive but diminishing re¬ 
turns in education outcomes to marginal increases in education spending. But this seemingly logical pattern is 
bound up in complex, circular or “endogenous” relationships. Notably, the 2012 OECD report finds nearly the 
same pattern for the relationship between GDP and PISA as for per-pupil spending and PISA. This is unsurprising 
because, as shown in Figure 5, wealthier nations spend more on education. So children in wealthier nations score 
better on PISA (up to a point), while children in higher-spending nations also score better on PISA (up to a point). 
But because education spending and GDP are correlated, the seemingly logical relationship—the “production 
curve” shown in Figure 4—actually tells us litde to nothing about a nation’s relative efficiency. 


What we see here is not reaUy a production function at aU, but rather a pattern that appears because of aU of the 
coinciding relationships underlying the data. For this reason, measuring “efficiency” against this curve is a suspect 
endeavor. Unlike legitimate production, cost and efficiency analysis, these analyses consider only a single year of 
cross-sectional data, and only levels of student outcomes; they do not adequately model the contribution of a na¬ 
tion’s education policies and practices to gains in their students’ outcomes. 


Alternatively, if we really wanted to make the U.S. look bad, we could fit a linear model to the data, which would produce a straight trendline, 
rather than the better fitting and more appropriate log-linear model (which produces a curved trendline) that we use here. Using a linear model 
increases the distance from the trendline for the U.S. and makes the country appear to be even more inefficient. Numerous such examples exist in 
the blogosphere, including: http://excelined.org/2013/09/20/things-heaven-earth-dr-ravitch-dreamt-ideology-ravitch-vs-reality-part-iii/ or here: 
http://bpr.berkeley.edu/2014/10/29/a-finnish-ed-model-for-national-education/ . These models are made even more useless to support the claim 
of alleged U.S. inefficiency by their exclusion of low-spending, low-outcome countries, which create the initial “bend” in the log-linear trend line. 
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Figure 5. National Fiscal Capacity and Education Spending 



$0 $10,000 $20,000 $30,000 $40,000 $50,000 $60,000 $70,000 $80,000 $90,000 

GDP per capita (in equivalent USD converted using PPPs)l 


OECD (2010), Table 1.2.20. Socio-economic indicators and the relationship with performance in reading, in PISA 2009 Results: What Students 
Know and Can Do, OECD Publishing, Paris. 

DOI: http://dx.doi.org/10.1787/9789264091450-table76-en 

Comparing Disparate and Distant Contexts 

If we wish to truly measure a country’s educational efficiency, we must move beyond simple spending/ 
outcomes correlations. For many reasons, however, it is nearly, if not entirely, impossible to estimate a legiti¬ 
mate cross-national education production or cost model. It’s difficult enough, if not implausible, to estimate 
such a model for America’s individual states given the vast structural and geographic differences among them. 
There are also significant problems with consistently measuring the value of the education dollar from New 
York to New Mexico (Taylor, 2006) and problems with consistently measuring student need factors, including 
child poverty, from one region to the next (Baker, Taylor, Levin, Chambers & Blankenship, 2013). The difficul¬ 
ties in addressing these issues are compounded when the frame of comparison moves to the international level. 

But the impossibility of achieving an adequate cross-national cost or production model should not impede us 
from at least exploring additional covariates and patterns that raise even more questions about common asser¬ 
tions of American inefficiency. For example, how much do available (albeit limited) measures of student need, 
including economic status, affect national average/aggregate PISA scores? To the extent that they do at all, they 
must be considered when making judgments about the relative efficiency of U.S. schools. 


SOCIO-ECONOMIC CONTEXT 

When constructing cost or production models for evaluating school or district efficiency, the goal is to find 
those measures of student attributes that are sufficiently exogenous—that is, do not have a circular relationship 
with the outcomes measured—and that, with reasonable causal explanation, are substantively correlated with 
the education outcome measure under investigation. There are, for example, many approaches to measuring 
poverty or socio-economic status. Some measures, like the U.S. Department of Education’s “free or reduced- 


Albert Shanker Institute | 9 














price lunch” metric, are crude, categorical variables. Other dichotomous measures, such as under/over the pov¬ 
erty line, will have differing thresholds, but the same limitations. Some measures characterise poverty in relative 
terms, with respect to the income distribution or some point within it; others measure poverty with respect to 
specific, absolute income levels (Coley & Baker, 2013). Due to the complexities of establishing specific, compa¬ 
rable income thresholds associated with “poverty” status (which are necessary for “absolute” poverty measure¬ 
ment), cross-national comparisons often use relative poverty measures, such as the share of children in families 
with less than half the median income. 

Figure 6 shows that even these seemingly less-precise measures of relative poverty are reasonably strongly asso¬ 
ciated with PISA mathematics literacy in 2012. The cross-national correlation is greater than 0.60 (R^ = 0.3638). 
In Figure 6, it would appear that the U.S. (on this one outcome measure) does better than expected, given its 
relative child poverty rate: The nation is above the trendline, indicating it outperforms this simple model’s pre¬ 
diction. 


Figure 6. PISA and Relative Poverty 



0.000 5.000 10.000 15.000 20.000 25.000 30.000 

Share of Children in Families with less than 50% Median Income 


PISA Data: http://nces.ed.gov/pubs2014/2014024_tables.pdf (table M4) 

OECD Relative Poverty: Source: Provisional data from OECD Income distribution and poverty database {www.oecd.org/els/social/inequality). 


Table 1 presents the correlations across a handful of measures compiled for a 2010 OECD report exploring 
relationships between socio-economic indicators and reading performance. The two strongest correlations with 
reading outcomes are for the “proportion of the population in the age group between 35 and 44 years with ter¬ 
tiary education,” and “share of students in their country whose PISA index of economic, social and cultural sta¬ 
tus is below -l.”^4 Notably, adult education levels, and child disadvantage levels are also associated with GDP 
(0.41 and -0.48), and accordingly with the spending measure (in this case, cumulative spending per student from 


14 “The Programme for International Student Assessment (PISA) index of economic, social and cultural status was created on the basis of 
the following variables: the International Socio-Economic Index of Occupational Status (ISEI); the highest level of education of the stu¬ 
dent’s parents, converted into years of schooling; the PISA index of family wealth; the PISA index of home educational resources; and the 
PISA index of possessions related to 'classical’ culture in the family home.” See: https://stats.oecd.org/glossary/detail.asp?ID=5401 . See 
also Ganzeboom (2010) for information on the ISEI. 
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age 6 to 15). In other words, a multitude of economic capacity, education spending, and student/family back¬ 
ground characteristics are modestly related across nations (where the number of nations included in these calcula¬ 
tions is only 34). 


Table 1. PISA and other SES measures 


Indicator 

Mean per¬ 
formance on 
the reading 
scale 

GDP per 
capita (in 
equivalent 
USD con¬ 
verted using 
PPPs)i 

Cumulative 
expenditure 
per student 
between 6 
and 15 years 
(in equivalent 
USD convert¬ 
ed using 

PPPs)i 

Percentage of 
the popula¬ 
tion in the 
age group 35- 
44 years with 
tertiary edu¬ 
cation^ 

Proportion 
of 15-year- 
olds with an 
immigrant 
background 

Share of stu¬ 
dents in their 
country whose 
PISA index of 
economic, so¬ 
cial and cultural 
status is below - 
1 

GDP per capita (in 
equivalent USD convert¬ 
ed using 

PPPs)i 

0.24 






Cumulative expenditure 
per student between 6 
and 15 years (in equiva¬ 
lent USD converted us¬ 
ing PPPs)^ 

0.30 

0.93 





Percentage of the popu¬ 
lation in the age group 
35-44 years with tertiary 
education^ 

0.67 

0.41 

0.43 




Proportion of 15-year- 
olds with an immigrant 
background 

0.11 

0.69 

0.63 

0.38 



Share of students in their 
country whose PISA 
index of economic, social 
and cultural status is 
below -1 

-0.68 

-0.48 

-0.55 

-0.58 

-0.33 


She of the 15-year-old 
student population 

-0.05 

0.01 

0.03 

0.12 

0.01 

0.21 


[1] Source: OECD, 'Education at a Glance 2010: OECD Indicators. OECD (2010), Table 1.2.20. Socio-economic indicators and the relationship with perfor¬ 
mance in reading, in PISA 2009 'Results: What Students 'Know and Can Do, OECD Publishing, Paris. DOI: http://dx.doi.org/10.1787/9789264091450-table76- 
en 
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Figure 7 shows the relationship between reading performance and the combined SES (socio-economic status) 
index, revealing a correlation of 0.64 (r-squared of 0.4097), or slightly stronger than the relationship between 
national relative poverty levels and math performance in 2012. Here, the U.S. falls slighdy below expectations 
but is certainly no underperforming (or overperforming) outlier. In any case, because measures of socio¬ 
economic status are significantly correlated to student outcomes, failure to consider these SES index or relative 
poverty measures when making assertions of relative efficiency across nations is a major oversight. 

Figure 7. PISA Reading and Combined SES Index 



-2.000 -1.500 -1.000 -0.500 0.000 0.500 1.000 1.500 

Average Socio-Economic/Resource Index 


OECD (2010), Table 1.2.20. Socio-economic indicators and the relationship with performance in reading, in PISA 2009 Results: What Students 
Know and Can Do, OECD Publishing, Paris. 

DOI: http://dx.doi.org/10.1787/9789264091450-table76-en 


51 Independently Financed and Operated Education Systems 

It is also important to recognize that the United States education system is, in fact, 51 largely independent edu¬ 
cation systems, where the majority of funding comes from state and local sources, and where accountability 
systems are adopted by states in compliance with federal statutes and regulations that allow for significant dif¬ 
ferences in funding, policies and practices. States also vary widely in measures of socio-economic status, includ¬ 
ing child poverty rates. The sheer size of the United States alone contributes to the heterogeneity of the coun¬ 
try’s student population. In the OECD 2010 report, for example, the U.S. education system is reported as serv¬ 
ing by far the largest total number of 15-year-olds (at nearly 3.4 million) with Mexico second (at just over 1.3 
million). So, in the aggregate, the U.S. system is large and diverse. But perhaps more important, the U.S. system 
is really a collection of 51 separate systems. Frequently cited “high-performing” nations like Finland serve only 
61,000 15-year-olds (1.8 percent of U.S. 15-year-olds); Korea enrolls 630,000 15-year-olds—18.7 percent of 
U.S. students that age. These other systems are much smaller in magnitude and tend to be more highly central¬ 
ized. There are also significant differences in the cultural, racial and linguistic diversity of different countries; 
arguably, the United States is more diverse than many “high-performing” nations (Alesina et ak, 2002; Fearon, 
2003), although determining the extent of these differences is a complex endeavor. 


Some have argued that the United States is relatively less diverse than many other OECD countries; see: http://educationbythenumbers. 
org/content/top-us-students-fare-poorly-international-pisa-test-scores-shanghai-tops-world-finland-slips 693/ . Determining the relative 
diversity of different countries is, however, quite complicated, as our sources in this paragraph demonstrate. At the very least, any attempt to 
introduce diversity as a variable in a model of international comparisons on test scores should explain how diversity indices were calculated. 


12 I Albert Shanker Institute 

















Because the states vary so significantly, we gain additional insights from those few reports that make efforts to 
compare individual U.S. states with foreign nations. Figure 8 is based on data from a 2012 OECD report that in¬ 
cluded PISA data and constructed the economic, social, and cultural status index for nations and three states— 
Connecticut, Florida and Massachusetts. Connecticut and Massachusetts are relatively affluent Northeastern states 
with relatively high per-pupil spending averages. Florida is a higher-poverty, much lower-spending Southern state 
(Baker, Sciarra & Farrie, 2015). Figure 8 shows that for all students, on average, Massachusetts and Connecticut 
students beat OECD averages while Florida students did not. Massachusetts and Connecticut students on average 
perform more similarly to students in Austria, Belgium and Germany, whereas Florida students perform more 
similarly to those in Croatia. 

Figure 8. Select U.S. States and National PISA Comparisons [All Students] 
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NOTE: The PISA index of economic, social and cultural status (ESCS) was created using student reports on parental occupation, the highest level 
of parental education, and an index of home possessions related to family wealth, home educational resources and possessions related to 
"classicar culture in the family home. The home possessions relating to "classical" culture in the family home included possessions such as 
works of classical literature, books of poetry, and works of art (e.g. paintings). The OECD average is the average of the national averages of the 
OECD member countries, with each country weighted equally. Standard error is noted by s.e. Italics indicate non-OECD countries and education 
systems. Results for Connecticut, Florida, and Massachusetts are for public school students only. 

SOURCE: Organization for Economic Cooperation and Development (OECD), Program for International Student Assessment (PISA), 2012. 


Even within relatively high-performing states like Massachusetts, however, there is significant variation in the so¬ 
cio-economic status of the student population. Figure 9 compares the math literacy scale scores for children in the 
top socio-economic quartile within their jurisdiction. Top quartile students in Massachusetts are outperformed by 
only select Chinese enclaves, Singapore and Korea. Connecticut is not far behind. But Florida, where even the top 
quartile is less well off, performs similarly to the Russian Federation and Sweden. 
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Figure 9. Select U.S. States and National PISA Comparisons [Top National/State Quartile ESCS 
Index] 


Top quarter 



NOTE: The PISA index of economic, social and cultural status (ESCS) was created using student reports on parental occupation, the highest level 
of parental education, and an Index of home possessions related to family wealth, home educational resources and possessions related to 
"classIcaT culture in the family home. The home possessions relating to "classical" culture In the family home included possessions such as 
works of classical literature, books of poetry, and works of art (e.g. paintings). The OECD average is the average of the national averages of the 
OECD member countries, with each country weighted equally. Standard error is noted by s.e. Italics indicate non-OECD countries and education 
systems. Results for Connecticut, Florida, and Massachusetts are for public school students only. 

SOURCE: Organization for Economic Cooperation and Development (OECD), Program for International Student Assessment (PISA), 2012. 


The next few figures put these three U.S. states into context among the other states using analyses similar to 
the previous simple cross-national pseudo-production function analyses. Again, these analyses simply com¬ 
pare spending and outcomes; they do not account for student, geographical or structure differences. Figure 10 
shows per-pupil spending and National Assessment of Educational Progress (NAEP) Reading Grade 8, and 
Figure 11 shows per-pupil spending and NAEP math Grade 8 for U.S. states in 2013. Adopting the facile log¬ 
ic of the common cross-national comparison, one might assert that Alaska and New York are woefully ineffi¬ 
cient, whereas Massachusetts and New Jersey are far more efficient, as are Idaho and Utah. Clearly though, 
there are as many missing pieces to this relationship as there were to the cross-national patterns. The math 
and reading pseudo-production function curves are similar, and the states are in similar positions. 


In another useful exposition found on the Economic Industry USA View blog, http://economyindustryusa.blogspot.com/2011 /01 / 
relationship-between-education-spending.html . the author combines student population adjustment, and U.S. state and OECD spending 
and outcome measures to show the position of a handful of U.S. states placed in international context. 


14 I Albert Shanker Institute 




























































































































Figure 10. The Unconditional Production Function for U.S. States [Reading Grade 8] 



$6,000 $8,000 $10,000 $12,000 $14,000 $16,000 $18,000 $20,000 $22,000 

Per Pupil Spending 

NAEP Data: http://nces.ed.gov/nationsrepoii;card/naGpdata/datasGt.aspx 
State Spending Data: http://www.CGnsus.gov/govs/school/ 


Figure 11. The Unconditional Production Function for U.S. States [Math Grade 8] 



$6,000 $8,000 $10,000 $12,000 $14,000 $16,000 $18,000 $20,000 $22,000 

Per Pupil Spending 

NAEP Data: http://nces.ed.gov/nationsreportcard/naepdata/dataset.aspx 
State Spending Data: http://www.census.gov/govs/school/ 


One of the biggest overlooked factors that simultaneously influences both state spending levels and state aver¬ 
age outcome levels is the economic status of families and children across states. Figures 12 and 13 show the 
relationship between state child poverty rates and state average scale scores. Like the cross-national relationship, 
and even more so, these relationships are strong, and cannot possibly be ignored in making judgments about the 
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relative efficiency of state systems. Connecticut and Massachusetts are relatively low-poverty, high-performing 
states; Florida, in contrast, is a higher-poverty, lower-performing state that also spends much less on its schools. 

Figure 12. Poverty and NAEP Outcomes (Reading Grade 8—2013) 


Reading Grade 8 



NAEP Data: http://nces.ed.gov/nationsreportcarcl/naepdata/dataset.aspx 

Poverty Data: http://www.census.gov/did/www/saipe/data/statecounty/data/index.html 


Figure 13. Poverty and NAEP Outcomes (Math Grade 8—2013) 
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Poverty Percent, Age 5-17 in Families 


NAEP Data: http://nces.ed.gov/nationsreportcard/naepdata/dataset.aspx 

Poverty Data: http://www.census.gov/did/www/saipe/data/statecounty/data/index.html 


16 I Albert Shanker Institute 





















As noted above, Connecticut and Massachusetts compare favorably with high-performing OECD nations, 
whereas Florida does not. But this finding is neither sufficient basis for lauding the achievements of Connecti¬ 
cut and Massachusetts, nor for condemning the failures of Florida, at least with respect to relative efficiency. A 
model that accounts for differences in student characteristics is certainly an improvement on simple cost/ 
outcome comparisons; however, it is still inadequate to the task of measuring efficiency. 

To summarise, when we closely explore U.S. state systems more comparable in magnitude to many national 
systems, we find that some U.S. states fare well in international comparisons while some do not. We also find 
that those states which do compare favorably tend to have lower rates of child poverty and higher rates of edu¬ 
cation spending—^where, like countries, economic conditions simultaneously affect the ability to raise and 
spend money on schools as well as the conditions in which children live and the outcomes they achieve. Given 
that most education policy—curricular policy, accountability frameworks and financing—^varies across states, it 
makes more sense to evaluate and compare our state systems in international context, but only while consider¬ 
ing each of the caveats laid out in the first section and in the next section of this brief. 


MATCHING INPUT AND OUTCOME MEASURES 

A major shortcoming of analyses purporting to evaluate cross-national relative efficiency of education systems 
regards the measurement of fiscal inputs to the education system. PISA, as well as other assessments, including 
the Trends in International Mathematics and Science Study (TIMSS), provides some reasonable standardization 
of outcome measures. But national education systems vary widely. Their governance varies widely. The scope 
of services and related expenditures covered under the umbrella of “education” vary widely. If one wished to 
do a legitimate evaluation of the relative efficient of producing math and reading outcomes, then one would have to pre¬ 
cisely identify the services intended to produce those outcomes, and isolate the expenditures on those services. 
Generously, if we assume that the core services within our own national education system and other systems have, 
as a central objective, improving math and reading outcomes, then we could at least focus on those “core ser¬ 
vices” whatever they may be. 

But even that task is complicated in the human and capital resource intensive process of delivering, at large 
scale, public (and publicly subsidized private) primary and secondary education services. Across nations, gov¬ 
ernance models and financial reporting systems, there exists: 

• Inconsistent governance and expenditure of employee health and pension benefits, 

• Inconsistent governance and expenditure of related health services and other disability services, 

• Inconsistent “coverage” of various other educational (and related) service components 
(extracurricular activities, transportation, food, etc.). 

So, for example, in nations where employee healthcare and pensions are nationalized through separate agencies, 
education spending may appear reduced. Where disability services are covered through other agencies, educa¬ 
tion spending may appear reduced. The same for transportation, food service or various other activities embed¬ 
ded or separated from “schooling.” Further, personal and family expenditures on supplemental programs may 
affect PISA or TIMSS outcomes, but may not be accounted for as school spending. Heyneman (2013) explains, 
for example, the amount of time and personal expense incurred by Korean families (not on the school expens¬ 
es) to support math achievement.^^ 

The commonly used OECD per-pupil spending figures fail to accurately isolate comparable educational ser¬ 
vices and relevant, comparable expenditures on those services; they are, therefore, of minimal (if any) use for 


http://educationnext.ofg/obvious-flaws-obviate-new-education-efficiency-index/ also discusses Korean supplemental spending. 
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cross-national efficiency evaluation. The figures are also not appropriately adjusted to account for input price 
variation; in other words, the purchasing power of the education dollar varies across and within nations. The 
most appropriate adjustments would account for the competitive wage required to recruit and retain similarly 
qualified teachers, as constructed for U.S. states (and labor markets within states) by Taylor (2006). The relative 
competitiveness of teacher wages matters greatly to the overall quality of entrants to the workforce (and stayers), 
and thus to the quality of schooling students receive (Baker, 2016). Finally, no attempt is made in the OECD per 
-pupil spending figures to adjust for other geographic factors, including population sparsity/remoteness 
(proportions of populations served under varied conditions). Put bluntly: If we wouldn’t compare per-pupil 
spending in Salina, Kan., and New York City without making the fuU range of appropriate adjustments, then we 
shouldn’t compare Croatia and the United States without doing so either. 

A Closer Look at Input Components 

While per-pupil spending figures reported by OECD are especially problematic, some insights may be gained by 
comparing the core elements of educational service provision—specifically information on teacher compensa¬ 
tion, teacher characteristics, teacher quantities (class sizes, overall staffing ratios) and teaching/school time. 
While the dollar value of teacher compensation should not be compared directly across contexts, the relative 
position of teachers in the labor market (compared to similarly educated, same-age peers) can be a useful indica¬ 
tor of the adequacy of teacher wages for recruiting and retaining a high-quality workforce. 

A substantial body of literature validates the conclusion that teachers’ overall wages and relative wages affect the 
quality of the individuals who choose to enter the profession, and whether they stay once they get in. For exam¬ 
ple, Murnane and Olsen (1989) found that salaries affect the decision to enter teaching and the duration of the 
teaching career, while Figlio (1997, 2002) and Ferguson (1991) concluded that higher salaries are associated with 
more qualified teachers. In addition, more recent studies have tackled the specific issues of relative pay noted 
above. Loeb and Page (2000) showed that: 

'^Once we adjust for labor market factors, we estimate that raising teacher wages by 10 percent reduces high 
school dropout rates by 3 percent to 4 percent Our findings suggest that previous studies have failed to produce 
robust estimates because they lack adequate controls for non-wage aspects of teaching and market differences in 
alternative occupational opportunities. ” 

In short, although salaries are not the only factor involved, they do affect the quality of the teaching workforce, 
which in turn affects student outcomes. 

Research on the flip side of this issue—evaluating spending constraints or reductions—reveals the potential 
harm to teaching quality that flows from leveling down or reducing spending. For example, Figlio and Rueben 
(2001) note: “Using data from the National Center for Education Statistics we find that tax limits systematically 
reduce the average quality of education majors, as well as new public school teachers in states that have passed 
these limits.” 

Salaries also play a potentially important role in improving the equff of student outcomes. While several studies 
show that higher salaries relative to labor market norms can draw higher-quality candidates into teaching, the 
evidence also indicates that relative teacher salaries across schools and districts may influence the distribution of 
teaching quality. For example, Ondrich, Pas and Yinger (2008) find that “teachers in districts with higher salaries 
relative to non-teaching salaries in the same county are less likely to leave teaching and that a teacher is less likely 
to change districts when he or she teaches in a district near the top of the teacher salary distribution in that 
county.” Finally, a recent study by Britton and Proper (2016) on schools in England found that a 10 percent in¬ 
crease to the wage gap between teachers and non-teachers was associated with a 2 percent reduction in assessed 
outcomes in the key exams taken at the end of compulsory schooling in England. 

Notably, while relative wages are a useful indicator for understanding education expenditures, it would be as fal¬ 
lacious to draw assertions from a simple relationship between nominal teacher salaries and PISA scores as it is to 
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use simplistic spending/outcome comparisonsd^ Even within the personnel component of school budgeting, 
wages are only part of what drives costs. Personnel expenses in education are a function of the cost of staffing 
(wages) and staffing quantities, typically reflected in pupil-to-teacher ratios and class sEes. Thus, it is useful to 
explore the two together. In schools, districts, states or nations with relatively greater total resources to spend, 
there exists greater flexibility to provide both competitive wages and smaller class sizes. As resources become 
scarce, tradeoffs become necessary. Because wages of non-teachers tend to be higher in wealthy countries, this 
creates upward pressure on education spending to maintain competitive compensation for teachers while main¬ 
taining palatable class sizes. Domestic research in the U.S. indicates benefits of smaller class sizes in lower grades 
(Baker, 2016). WoBmann and West (2006) find that across nations, smaller class sizes seem more important 
where teacher wages are low. 

Figure 14 shows the relationship (or, more accurately, the lack of a relationship) between relative teacher com¬ 
pensation and class sizes at the primary level. Nations to the left have relatively noncompetitive teacher wages; 
those to the right have relatively competitive teacher wages, outpacing those of non-teachers (in Korea, for ex¬ 
ample, teacher wages are 30 percent higher than non-teacher wages). Teacher wages in the U.S. are relatively non¬ 
competitive, both as a function of relatively low absolute wage and as a function of high (and more rapidly grow¬ 
ing) non-teacher wage (AUegreto, Corcoran & Mishel, 2011. Meanwhile, class size, the vertical dimension in the 
figure, in U.S. primary schools is relatively average. Class sizes in Chile, which also has relatively low teacher wag¬ 
es, are quite high. Korean primary class sizes are somewhat larger, but teacher wages much higher. 


Figure 14. Class Size (Primary) and Relative Teacher Wages 
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OECD (2014), Education at a Glance 2014: OECD Indicators, OECD Publishing, http://dx.doi.org/10.1787/eag-2014-en 
Table D2.1. Average class size, by type of institution and level of education (2012) & 

Table D3.2. Teachers' salaries relative to earnings for full-time, full-year workers with tertiary education (2012) 


18 For a particularly egregious application, see: http://www.economist.com/news/international/21616978-liigher-teacher-pay-and-smaller-classes-are-not- 
be st-education-policie s-new- school . 
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Figure 15 shows the relationship between lower secondary class sizes and relative salaries. Again, U.S. teacher 
wages are relatively noncompetitive. In lower secondary grades, U.S. class sizes are relatively large. The seem¬ 
ingly high per-pupil spending figure of the U.S. does not, therefore, translate into either competitive wages or 
small classes; instead, the U.S. has relatively noncompetitive wages and average to larger-than-average class siz¬ 
es. These indicators provide a more accurate characterization of our investment in schooling, relative to other 
nations, than comparisons of nominal dollar inputs. 


Figure 15. Class Size (Lower Secondary) and Relative Teacher Wages 
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OECD (2014), Education at a Glance 2014: OECD Indicators, OECD Publishing, http://dx.doi.org/10.1787/eag-2014-en 
Table D2.1. Average class size, by type of institution and level of education (2012) & 

Table D3.2. Teachers' salaries relative to earnings for full-time, full-year workers with tertiary education (2012) 


The apparently high spending level of the United States coupled with the apparently modest class sizes and low 
relative wages might lead one to assume that the U.S. simply isn’t getting resources into the classroom or that 
these findings do, in fact, reveal inefficient spending. On the one hand, it may be the case that classroom teach¬ 
er salary expenses are lower in the U.S. for reasons discussed previously: that the U.S. education expenditure 
embeds far more than just classroom salary expenses, including pension and health benefits, transportation, 
food and extra-curricular activities. But it is also important to understand that the U.S. has low relative wages 
for teachers in a very high GDP context, where the wages of non-teachers are high. In other words, given high 
GDP and non-teacher wages, higher spending may be needed in the U.S. to achieve both competitive wages 
and reasonable class sizes. 

Table 2 calculates the classroom salary per pupil, based on actual salary and class size data, for primary and low¬ 
er secondary grades and evaluates classroom salaries per pupil as a share of per-pupil spending. Table 2 shows 
that the U.S. share of spending in classroom salaries per pupil is relatively low in the primary level and lowest 
for lower secondary, but not substantially out of line with other countries for which data were available. 
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Figure 16 and Figure 17 provide some visual context for the values reported in Table 2. Notably, the relationship 
across countries between overall spending levels and classroom spending levels is strong and linear—vindicating 
that countries spending more overall are spending more on teacher salaries per classroom. This includes the 
United States, which falls below, but near the trendline for primary education, but further below the trendline 
(near Finland, however) for lower secondary education. Overall, the pattern is looser for lower secondary educa¬ 
tion (although the correlation is still strong), perhaps suggestive of more varied ranges of services and activities 
provided at this level. 
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Figure 16. Total per-Pupil and Estimated Classroom Salary Spending (Primary) 



Per Pupil Expenditure (Primary education) 


OECD (2014), Education at a Glance 2014: OECD Indicators, OECD Publishing, http://dx.doi.org/10.1787/eag-2014-en 
Table D2.1. Average class size, by type of institution and level of education (2012) & 

Table D3.2. Teachers' salaries relative to earnings for full-time, full-year workers with tertiary education (2012) 

Table D3.4. Average actual teachers’ salaries (2012) 
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Figure 17. Total per-Pupil and Estimated Classroom Salary Spending (Lower Secondary) 
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Figure 18 and Figure 19 show the relationship between relative poverty and class sizes across nations. Chile has 
particularly large class sizes and high relative poverty. It also has lower-than-average relative wages and performs 
poorly on PISA (Figure 4). Israel and Turkey also have high poverty and relatively large class sizes but more com¬ 
petitive wages than Chile or the U.S. For lower secondary grades, the U.S. joins these countries, having the com- 
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bination of high poverty and large class sizes, and joins Chile in also having low relative wages. While still a limited 
view on resources and context, one would not likely expect superior international test scores from a nation with 
noncompetitive compensation for its teachers, average to large classes, and high poverty. Perhaps the United 
States’ seemingly mediocre PISA scores are, in fact, in line with expectations, given our inputs and context. Per¬ 
haps we are not the model of inefficiency, but rather, about average—^in line with expectations, but nothing more 
and nothing less. 


Figure 18. Class Size (Primary) and Relative Poverty 
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CONCLUSIONS AND IMPLICATIONS 


What does this all mean? First and foremost, we can say with some confidence that existing expositions of U.S. 
inefficiency based on OECD national spending data and PISA scores are so lacking in methodological rigor 
that they are of little if any value in public discourse or for informing national education policies. Second, it is 
unHkely that we could ever obtain data of sufficient precision, accuracy and comparability to meet the demands 
of more legitimate efficiency modeling for cross-national, intercontinental analyses. But that does not mean we 
can’t learn anything at all from available data; as long as we deal with them cautiously, understand that we are 
viewing moment-in-time snapshots of limited measures, and realize the extent of what’s missing from any such 
cross-national descriptive analysis, there are actually important insights to be gained from appropriate analyses 
of the international data. 

Among other things, the OECD per-pupil spending measure, as incomparable as it is, shows that the U.S. may 
have higher per-pupil spending than many nations, but falls right in line with expectations for nations of similar 
GDP per capita. The United States is both a high-spending and high-GDP country, but some of that high edu¬ 
cation spending may be a function of the scope of services and expenses included under the education umbrel¬ 
la in the U.S. We also know that despite its seemingly high spending levels, the United States’ teacher wages lag 
with respect to other professions, and the wage lag is not a result of providing relatively smaller class sizes. In 
fact, our primary class sizes are average and lower secondary class sizes large. Our wage lag is, to an extent, a 
function of high non-teaching wages (related to our high GDP per capita), necessarily making it more expen¬ 
sive to recruit and retain a high-quality teacher labor force. To summarize: The United States is faced with a 
combination of seemingly high education expense, but noncompetitive compensation for its teachers, average 
to large classes, and high child poverty. Again, it’s hard to conceive how such a combination would render the 
U.S. comparable in raw test scores to low-poverty nations like Korea or Finland, or small, segregated, homoge¬ 
neous enclaves like Singapore or Shanghai. 

If there exists any possible example of classic over-allocation (as usually ascribed to the United States), it might 
be found in Luxembourg, where poverty is very low, spending is high, class sizes are very small, and teacher 
wages are very competitive. It would be difficult for PISA scores to line up with this full set of contextual and 
resource advantages. But this finding for Luxembourg comports with public finance literature regarding ineffi¬ 
ciency: Countries that can afford to spend more are more likely to spend “inefficiently,” at least so far as can be 
measured in outcomes that only focus on math and language arts test scores. 

Finally, it is equally important to understand the magnitude and heterogeneity of the U.S. education system in 
the context of OECD comparisons, which mainly involve more centralized and much smaller education sys¬ 
tems. Lower-poverty, higher-spending states that have been included in international comparisons, like Con¬ 
necticut and Massachusetts, do quite well, while lower-spending higher-poverty states like Florida do not. This 
unsurprising finding, however, also tells us litde about relative efficiency, and provides little policy guidance for 
how we might make Florida more like Massachusetts, other than by waving a wand and making it richer, more 
educated and perhaps several degrees colder. 


i^Shanghai in particular has several mitigating factors that make comparing its scores to those of other nations highly suspect; see: 
http://www.brookings.edu/research/papers/2013/10/09-pisa-china-problem-loveless . 
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Appendix A. Alternative Unconditional, Nominal 

Following are additional variations of the "unconditional PISA production function" as discussed and presented 
in Figure 4. Each illustrates roughly the same idea - the supposed diminishing returns curve of national spending 
and outcomes. But each, like Figure 4 largely displays that wealthier nations simultaneously spend more and 
achieve higher test scores, without sorting out various intervening factors. 
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OECD (2010), PISA 2009 Results: What Students Know and Can Do - Student 

Performance in Reading, Mathematics and Science (Volume I) http://dx.doi.org/10.1787/9789264091450-en 
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